A Bayesian Wilcoxon signed-rank test based on the Dirichlet process
نویسندگان
چکیده
Bayesian methods are ubiquitous in machine learning. Nevertheless, the analysis of empirical results is typically performed by frequentist tests. This implies dealing with null hypothesis significance tests and p-values, even though the shortcomings of such methods are well known. We propose a nonparametric Bayesian version of the Wilcoxon signed-rank test using a Dirichlet process (DP) based prior. We address in two different ways the problem of how to choose the infinite dimensional parameter that characterizes the DP. The proposed test has all the traditional strengths of the Bayesian approach; for instance, unlike the frequentist tests, it allows verifying the null hypothesis, not only rejecting it, and taking decisions which minimize the expected loss. Moreover, one of the solutions proposed to model the infinite-dimensional parameter of the DP allows isolating instances in which the traditional frequentist test is guessing at random. We show results dealing with the comparison of two classifiers using real and simulated data.
منابع مشابه
Dirichlet process with application to the hypothesis test on the probability that X ≤
The Dirichlet process (DP) is one of the most popular Bayesian nonparametric models. An open problem with the DP is how to choose its infinite-dimensional parameter (base measure) in case of lack of prior information. In this work we present the Imprecise DP (IDP)—a prior near-ignorance DP-based model that does not require any choice of this probability measure. It consists of a class of DPs ob...
متن کاملImprecise Dirichlet process with application to the hypothesis test on the probability that X ≤ Y
The Dirichlet process (DP) is one of the most popular Bayesian nonparametric models. An open problem with the DP is how to choose its infinite-dimensional parameter (base measure) in case of lack of prior information. In this work we present the Imprecise DP (IDP)—a prior near-ignorance DP-based model that does not require any choice of this probability measure. It consists of a class of DPs ob...
متن کاملA Generalization of Wilcoxon Rank Sum Test
In this paper, we consider applying the Wilcoxon’s idea for the construction of the one-sample problem to the two-sample case. For this, we show that the location translation parameter becomes a median of the distribution of the difference of two independent random variables under the location translation model. Based on this fact, we construct generalized signed-rank statistics for the two sam...
متن کاملBayesian Comparison of Machine Learning Algorithms on Single and Multiple Datasets
We propose a new method for comparing learning algorithms on multiple tasks which is based on a novel non-parametric test that we call the Poisson binomial test. The key aspect of this work is that we provide a formal definition for what is meant to have an algorithm that is better than another. Also, we are able to take into account the dependencies induced when evaluating classifiers on the s...
متن کاملWilcoxon signed rank test for imprecise observations
This article extends the Wilcoxon signed rank test for testing whether the median of a population is a specified constant by treating the observations as imprecise values. The test procedure is developed by using the concepts of “Credibility Theory” for studying the behavior of fuzzy phenomena. Numerical illustration of the proposed test is provided by representing the observations as trapezoid...
متن کامل